GMM-Based Identification of Indonesian Speech

نویسندگان

  • Amalia Zahra
  • Julie Carson-Berndsen
چکیده

This paper reports the performance of identification of Indonesian speech within a ten-language corpus: English, German, Hungarian, Indonesian, Italian, Korean, Mandarin, Polish, Portuguese, and Swedish. The tasks are performed by implementing Gaussian Mixture Model (GMM) on MelFrequency Cepstral Coefficients (MFCCs). Two types of experiments that have been undertaken: pair-wise and tenlanguage experiments. In the pair-wise experiments, the performance of the model in identifying Indonesian within every language pair is evaluated. The experiments show that Indonesian is best distinguished from English, Korean, and Portuguese with 90.5% of accuracy. In ten-language experiments, the highest accuracy of identifying Indonesian is 85.71%.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

مقایسه روش های طیفی برای شناسایی زبان گفتاری

Identifying spoken language automatically is to identify a language from the speech signal. Language identification systems can be divided into two categories, spectral-based methods and phonetic-based methods. In the former, short-time characteristics of speech spectrum are extracted as a multi-dimensional vector. The statistical model of these features is then obtained for each language. The ...

متن کامل

Using Context-based Statistical Models to Promote the Quality of Voice Conversion Systems

This article aims to examine methods of optimizing GMM-based voice conversion systems performance in which GMM method is introduced as the basic method for improvement of voice conversion systems performance. In the current methods, due to using a single conversion function to convert all speech units and subsequent spectral smoothing arising from statistical averaging, we will observe quality ...

متن کامل

Discriminative Training of GMM for Language Identificatio..

In this paper, a discriminative training procedure for a Gaussian Mixture Model (GMM) language identification system is described. The proposal is based on the Generalized Probabilistic Descent (GPD) algorithm and Minimum Classification Error Rates formulated to estimate the GMM parameters. The evaluation is conducted using the OGI multi-language telephone speech corpus. The experimental result...

متن کامل

The Approach of Speaker Diarization by Gaussian Mixture Model (GMM)

Speaker identification is an important activity in the process of speaker diarization. We need to model the speaker by Gaussian mixture model (GMM) for speaker identification purpose. Large GMM is called as a Universal Background Model (UBM) which is adapted into each speaker model for speaker identification purpose. This paper focuses on speech clustering for speaker diarization. The speaker d...

متن کامل

Performance Analysis of Speaker Identification System Using GMM with VQ

Personal identity identification is an important requirement for controlling access to protected resources. Biometric identification by using certain features of a person is a more secured solution for security identification. Advances in speech processing technology and digital signal processors have made possible the design of high-performance and practical speaker recognition systems. A more...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010